A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users
نویسندگان
چکیده
In this paper a multi-channel speech enhancement framework for distant speech acquisition in noisy and reverberant environments for Non-negative Matrix Factorization (NMF)-based Automatic Speech Recognition (ASR) is proposed. The system is evaluated for its use in an assistive vocal interface for physically impaired and speech-impaired users. The framework utilises the Spatially Pre-processed Speech Distortion Weighted Multichannel Wiener Filter (SP-SDW-MWF) in combination with a postfilter to reduce noise and reverberation. Additionally, the estimation uncertainty of the speech enhancement framework is propagated through the Mel-Frequency Cepstrum Coefficients (MFCC) feature extraction to allow for feature compensation in a later stage. Results indicate that a) using a trade-off parameter between noise reduction and speech distortion has a positive effect on the recognition performance with respect to the well-known GSC and MWF and b) the addition of a postfilter and the feature compensation increases performance with respect to several baselines for a non-pathological and pathological speaker.
منابع مشابه
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition
We propose a novel acoustic beamforming method using blind source separation (BSS) techniques based on non-negative matrix factorization (NMF). In conventional mask-based approaches, hard or soft masks are estimated and beamforming is performed using speech and noise spatial covariance matrices calculated from masked noisy observations, but the phase information of the target speech is not adeq...
متن کاملMulti-Constraint Nonnegative Matrix Factorization Approach to Speech Enhancement with Nonstationary Noise
The enhancement of speech degraded by nonstationary noises and low signal-to-noise ratio (SNR) conditions is a high demanding but challenging task. We present a robust and effective single channel speech enhancement algorithm under the framework of Nonnegative Matrix Factorization (NMF). Considering the sparse property of speech and low-rank property of nonstationary noise, a new formulation fo...
متن کاملSingle-channel speech enhancement based on non-negative matrix factorization and online noise adaptation
In this paper, we demonstrate a simulator for real-time speech enhancement based on a non-negative matrix factorization (NMF) technique. In particular, we propose an online noise adaptation method in an NMF framework, which is activated during non-speech intervals and used for adapting noise bases for NMF. Thus, incoming noisy speech is decomposed by using such adapted noise bases and universal...
متن کاملA New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملSpeech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering
This paper presents new adaptive filtering techniques used in speech enhancement system. Adaptive filtering schemes are subjected to different trade-offs regarding their steady-state misadjustment, speed of convergence, and tracking performance. Fractional Least-Mean-Square (FLMS) is a new adaptive algorithm which has better performance than the conventional LMS algorithm. Normalization of LMS ...
متن کامل